Parallel Data Loader and OLAP Operations
نویسنده
چکیده
Constructing fact tables and performing OLAP operations are some of the most essential but expensive operations in data warehousing. OLAP operations require aggregation on many combinations of each dimension attribute. As the number of dimensions increase, it becomes very expensive to compute summary data cubes because the cost of required computations grows exponentially with the increase of number of dimensions. Query processing for these applications requires different views of data for analysis and effective decision-making. As data warehouses grow, parallel processing techniques can be applied to enable the use of larger data sets and reduce the time for analysis, thereby enabling evaluation of many more options for decisionmaking. Architecture for computing OLAP operations in parallel, using multiple processors is presented in this paper.
منابع مشابه
Do the middle letters of “OLAP” stand for Linear Algebra (“LA”)?
Inspired by pointfree relational data processing, this paper addresses the foundations of an alternative roadmap for parallel online analytical processing (OLAP) based on a separation of concerns: rather than depending on standard database technology and heavy machinery, OLAP operations are performed by encoding data in matrix format and relying thereupon solely on LA operations. The paper inve...
متن کاملA Parallel Scalable Infrastructure for OLAP and Data Mining
Decision support systems are important in leveraging information present in data warehouses in businesses like banking, insurance, retail and health-care among many others. The multi-dimensional aspects of a business can be naturally expressed using a multi-dimensional data model. Data analysis and data mining on these warehouses pose new challenges for traditional database systems. OLAP and da...
متن کاملDesign and Implementation of a Scalable Parallel System for Multidimensional Analysis and OLAP
Multidimensional Analysis and On-Line Analytical Processing (OLAP) uses summary information that requires aggregate operations along one or more dimensions of numerical data values. Query processing for these applications require different views of data for decision support. The Data Cube operator provides multi-dimensional aggregates, used to calculate and store summary information on a number...
متن کاملPARSIMONY: An Infrastructure for Parallel Multidimensional Analysis and Data Mining
Multidimensional analysis and online analytical processing (OLAP) operations require summary information on multidimensional data sets. Most common are aggregate operations along one or more dimensions of numerical data values. Simultaneous calculation of multidimensional aggregates are provided by the Data Cube operator, used to calculate and store summary information on a number of dimensions...
متن کاملIn-memory OLAP aggregation on GPUs using CUDA Dynamic Parallelism
Most queries involved with Online Analytical Processing (OLAP) depend on the functionality of aggregating data along the multidimensional hierarchies of an OLAP cube. In real-time OLAP, aggregated data for interactive operations e.g. roll-up and drill-down is computed on-the-fly. Fast response times are essential and can be accelerated significantly through data-parallel computation on graphics...
متن کامل